PROTOLINGUISTICS: THE STUDY OF 
PROTOLANGUAGES AS AN AID 
TO GLOSSOGONIC RESEARCH 


Roger W. Wescott 


Reprinted from 
ANNALS OF THE NEW YORK ACADEMY OF SCIENCES 
Volume 280 Pages 104-116 
October 28, 1976 24695 


° 


PROTOLINGUISTICS: THE STUDY OF 
PROTOLANGUAGES AS AN AID 
TO GLOSSOGONIC RESEARCH 


Roger W. Wescott 


Department of Anthropology 
Drew University 
Madison, New Jersey 07940 


As used here, the term “protolinguistics” will mean the analytic and comparative 
study of protolanguages. Protolanguages, in turn, will mean reconstructed ancestral 
languages, ranging from such relatively firm constructs as Proto-Indic (the archaic 
Prakrit ancestral to Hindi and Bengali) to such relatively speculative constructs as 
the Universal Proto-Speech of the Upper Paleolithic Period, which I have elsewhere 
postulated as ancestral to all contemporary spoken languages.! 

Nearly all historical linguists employ the term “protolanguage,” and many cite 
hypothetical forms from reconstructed tongues. Yet a minority of these scholars deny 
that linguistic reconstructions are approximations of the actual sound and shape of 
dead languages and hold instead that such reconstructions are merely abstract 
formulae conveniently relating attested languages to one another. My own view of 
this matter is optimistically majoritarian. While I recognize that all reconstructions 
are tentative—and will argue below that most protolinguists persist in making 
avoidable reconstructive errors—I nevertheless believe that reconstructed languages 
do recapture, in large part, at least, the substance of vanished languages. As an 
example, I would cite Proto-Romance, the reconstructed ancestor of French, Italian, 
and Rumanian. As estimated by Robert Hall, its overall correspondence to recorded 
Classical Latin is about 85%.2 This congruence, I would say, is no more than we 
should expect of a diglossic situation in which the speech of literates diverges 
markedly from that of illiterates. 

No discussion of language origins can avoid the question of monogenesis, that is, 
whether all known spoken languages have a single common origin. Oddly enough, 
most linguists have taken the position that, in the absence of irrefutable evidence on 
either side of the question, the burden of proof in the matter rests with the 
monogeneticists. I say “oddly” because, as an anthropologist, | am keenly aware that 
in regard to the analogous question of the origin of life, biologists have taken 
precisely the opposite position, assuming monogenesis and assigning the burden of 
proof to polygeneticists. Regarding linguistics as a branch of anthropology and 
anthropology in turn as a branch of biology, I am inclined to follow biological 
precedent in this matter and assume monogenesis until polygenesis is demonstrated. 
This inclination is strengthened by my ethnological knowledge of the fact, for 
example, that the alphabet, although now global in extent and multifarious both in 
form and in the order of its constituent letter, originated once and only once in the 
Eastern Mediterranean region early in the first millennium B.C. 

Just as we know the vertebrates and the alphabet to have radiated and 
differentiated over space and time, we can reasonably infer, I think, that speech did 
the same thing. If this is so, we can further infer that protolanguages resembled one 
another more than contemporary languages resemble one another and that the 
earlier the protolanguages were, the more alike they were—back to the point at which 
they converged. This point I would locate in the Upper Paleolithic Period (between 
20,000 and 50,000 years ago). I place it before the subsequent Mesolithic Period 
because | believe that fewer than 20 millennia would not have produced the linguistic 


104 


Wescott: Protolinguistics 105 


diversity we observe today; and I place it after the preceding Middle Paleolithic 
because I believe that our hominid predecessor, Neanderthal Man, though capable of 
manual language, was incapable of vocal language. 

Indirect evidence favoring this view comes from the documented history of well- 
known language families such as Indo-European. Here we find that the earliest 
recorded forms of most languages are patently more similar than are their 
contemporary forms. A particularly graphic illustration of this tendency is provided 
by Avestic, the Zoroastrian scriptural language of the ancient Iranians, and Sanskrit, 
the Hindu scriptural language of the ancient Indo-Aryans. These two languages were 
so similar that by performing relatively minor phonological transformations, one 
could readily convert an Avestic text into a Sanskrit text without any change in 
vocabulary or word order. No such ready conversion, needless to say, could be 
performed between modern Pushtu and modern Hindi. 

Since the Indo-European languages are the world’s most thoroughly analyzed 
language family and Proto-Indo-European is the most extensively reconstructed 
protolanguage, we will begin our protolinguistic exposition with Indo-European. I 
assume that Proto-Indo-European (hereafter PIE) was spoken by a congeries of 
neolithic tribes moving rather freely, perhaps as transhumant pastoralists, between 
central Europe and eastern Turkestan during the 3rd millennium B.C. Its system of 
23 segmental phonemes I reconstruct (following Winfred Lehmann but deviating 
from his exposition of vowels and “laryngeals”3) as follows: 


p t k k” 
b d w 
bh dh en hw 
S 
m n 
r 
l 
i ə u 
e a (0) 


Here it should be understood that i, 2, and u are glides rather than vowels (and hence 
that they pattern as sonorants, like the linguals and nasals) and that length is a 
prosodic feature that may combine with any nonobstruent. 

Since the discovery of Hittite in the 1920s, it has become clear—as both 
structuralists and monogeneticists had previously suspected that pre-Indo- 
European had had at least one and possibly several dorsal obstruents. If we define 
PIE as the protolanguage ancestral to English, Spanish, Russian, and Hindi, then, 
following Edgar Sturtevant,4 we may posit an earlier protolanguage ancestral both to 
PIE and to such Anatolian languages as Hittite, Luwian, Palaic, and (perhaps) 
Armenian, and call it “Proto-Indo-Hittite” (hereafter PIH). PIH was probably 
spoken by a horticultural people in the circum-Caucasian area in the 4th millennium 
B.C. A plausible inventory of its 20 or so segmental phonemes follows: 


p t k Pe) 
pe) e k> ; 
ph th kh 

3 x h (?) 
m n 

h 

l 
w y 
o e 


106 Annals New York Academy of Sciences 


Three of these phonemes are of doubtful status. These are the glottalized bilabial 
stop, which yields no certain reflex in Anatolian and only the rare voiced bilabial stop 
in PIE; and the voiceless glottal stop and the voiceless glottal fricative, which are 
absent from both Anatolian and PIE but can be inferred for their parent tongue on 
the basis of internal evidence in each. 

The firmer sound-correspondences between Anatolian and IE (the former 
illustrated exclusively from cuneiform Hittite) follow: 


Phoneme Hittite Indo-European* 
p paxur, “fire” PIE pur- 
ph parku8, “pure” bhreg-, “white” 
t titami, “stand” stistami 
t teik-, “show” deik- 
th tehi, “place” dħē-, “put” 
k kwiš, “who?” k¥is 
k’ kenu, “knee” genu 
kh kwenzi, “strike” gw enti 
š Siptam-, “seventh” sept(o)m- 
x xanti, “in front” anti- 
m mal-, “grind” mol- 
n newaš, “new” neuos 
r pra-, “forth” pro- 
l lak-, “lie down” logh., “lay” 
y yukan, “yoke” iugom 
w wes, “we” ue- 
€ eSmi, “am” esmi 
o arpa-, “misfortune” orbħo-, “bereft” ’ 


*IE meanings are given only when they differ from Hittite meanings. 


Once we leave Indo-Hittite, agreement about cognation between Indo-European 
and other languages falls off precipitously. Nonetheless, striking lexical correspon- 
dences persist. Outside the Anatolian family, the largest number of cognates (in 
proportion to total vocabulary) appears to come from Etruscan, a language isolate 
from pre-Roman Italy,> which seems to have had about 20 segmental phonemes. 
The protolanguage that was ancestral to PIH and Etruscan is one that I would call 
(on the analogy of other Indo-hyphenations) Indo-Etruscan and assign to incipiently 
horticultural peoples of western Eurasia from the Sth millennium B.C. 

The more persuasive sound-correspondences between Etruscan and PIH (the 
latter illustrated from various IH languages) follow: 


(proto-)phoneme Etruscan Indo-Hittitet 
p ġel, “container” Greek pélla, “bowl” 
t tecum, “earth” Hittite tekan 
k cap, “take” Old Irish gaibid 
s se-, “six” Latin sex 
x hanĝ, “in front” Hittite xanti 
m man-, “think” Sanskrit manyaté 
n nep-, “humid” Greek néphos, “cloud” 
r tur-, “give” Greek dõron, “gift” 
l lautn, “family” Lithuanian liáudis, “people” 


tIH meanings are given only when they differ from Etruscan meanings. 


Wescott: Protolinguistics 107 


y ecnia, “offering” Sanskrit yajiia, “sacrifice” 
w var-, “water” Sanskrit vari 

e eme, “I” Greek emé-ge, “me” 

a am-, “love” Latin amo 


In the preceding table, the three capital letters in the first column represent what 
Swadesh called “protophonemes”; that is, sound-classes that resemble morpho- 
phonemes in containing phonetically similar phonemes but differ from morpho- 
phonemes in that their phonemic realizations are semantically rather than 
morphologically conditioned. (As we move back in time, the number of such 
phonosemic stop variants in each articulatory position becomes increasingly more 
difficult to specify. Even PIE poses problems here. Although most Indo-Europeanists 
follow the Greek model and reconstruct three manners, a substantial minority follow 
the Indic model and reconstruct four. Moreover, it remains possible that both groups 
are wrong and that we ought, instead, to follow the Slavic model and reconstruct two 
manners of articulation—or even, following the Tocharian model, only one for each 
articilatory position. Anatolian evidence is of little help in this inquiry, since Hittite 
and Lycian may be interpreted as having had either one or two manners of occlusion. 
And Etruscan, though it clearly had two manners, shows no consistent pattern of 
correspondence between these and the manners of Indo-European. Uralic, finally, 
resembles PIE in the reconstructive difficulties it poses: a majority of Uralicists posit 
only one articulatory manner for each occlusive position in Proto-Uralic, here- 
after PU. But a minority posit three.) 

If we exclude isolates like Etruscan and seek cognation between Indo-Hittite and 
a language family of comparable complexity, we find that the family offering the 
largest number of plausible lexical correspondences is Uralic, which embraces 
(among other languages) Finnish, Hungarian, and Samoyed. The protolanguage 
that was ancestral to PIH and PU is one which I would call Indo-Uralic and assign to 
a hunting and gathering people of western Eurasia from the 6th millennium B.C. 

The more persuasive sound-correspondences between PU and Proto-Indo- 
Etruscan (the latter illustrated from PIE, PIH, and Etruscan) follow. 


A Proto-Uralic Indo-Etruscan 

p per-, “far” PIE per-, “traverse” 

t te-, “thou” PIE te-. “thee” 

k ku-, “who?” PIH kwi- Í 

s sen, “he” PIE se-, ne- 

m man-, “guess” Etruscan man-, “think” 
n nimi, “name” PIE nEm- 

T utar-, “udder” PIE uTer- 

l kol-, “pale” PIH KEL-, “bright” 

y kar-, “herd” PIE ker- 

w vet-, “water” PIE uEd- 

e met-, “honey” PIE medh- 

a kap-, “hoof” PIE kap-, “hoof; goat” 


tUnmarked PIE & PIH meanings are as in PU. 


Beyond Uralic, no language or language-group presents large numbers of 
conspicuous lexical cognates with Indo-European. But strong, if indirect, 
genealogical links have recently been forged between IE on the one hand and 
Dravidian and Elamite on the other by Stephen Tyler? and David McAlpin.’ More 


108 Annals New York Academy of Sciences 


specifically, Tyler has demonstrated extensive lexical correspondences between PU 
and Proto-Dravidian (hereafter PD); whereas McAlpin has done the same for PD 
and the various stages of Elamite, from Proto-Elamite (in this case a hieroglyphically 
recorded rather than a reconstructed language) through Middle Elamite to 
Achaemenid Elamite. In effect, they have established a linguistic chain that links IE 
with Elamite through the intermediacy of Uralic and Dravidian, .respectively. 

This genealogical chain, of course, is reminiscent of Holger Pederson’s Nostratic 
hypothesis, according to which Indo-European, Hamito-Semitic, Ural-Altaic, 
Dravidian, and Kartvelian (or southern Caucasian) form a single linguistic 
macrophylum.’? From a monogenetic point of view, the shortcoming of the Nostratic 
hypothesis is not that it is false—since all spoken languages are necessarily cognate— 
but rather that it is misleading, at least insofar as it implies that Eurasian languages 
have something in common with one another that they do not have in common with 
African, Oceanian, and American languages. For it is virtually certain that the 
reasons why the kinship of Indo-European with Semitic and Uralic is so widely 
perceived are, first, that the regional contiguity of the three families makes their 
cognation relatively easy to believe in and, second, that the thoroughness with which 
they have been analyzed makes it comparatively easy to pick out shared lexical items 
among them. 

As a discipline, comparative protolinguistics, it should be admitted, remains 
relatively weak. But this weakness, I believe, is due far more to its underdeveloped 
state than to any inherent inadequacy in either linguistic reconstruction or in 
comparative methodology. To strengthen comparative protolinguistics, I rec- 
ommend at least two methodological reforms. The first of these is utilization, by 
comparativists, not only of lexicons in the conventional sense—that is, of inventories 
of free morphemes or “words”—but also of what I call “morphicons,” or inventories 
of bound morphemes or affixes.!° In fact, | would go further and assert that 
morphicons should not be used merely to supplement lexicons; I should say, rather, 
that wherever possible, morphicons should be used in preference to lexicons, and that 
it is the lexicons which should be regarded as evidentially secondary. 

My reason for preferring morphicons to lexicons is that affixes are far less often 
borrowed than are words. In the case, for example, of the relation of Indo-European 
to Uralic, skeptics can always argue that lexemes like PU met-, “honey,” and utar-, 
“udder,” were loanwords from PIE. But they find it much harder to dismiss such verb 
suffixes as the PU/PIE preterit -s- or agential -nt- in the same way.!! 

The second methodological reform that I would propose for comparative 
protolinguistics (as for comparative linguistics generally) is abandonment of the 
widespread practice of using Morris Swadesh’s lexicostatistic protocols as devices for 
demonstrating interlanguage cognation. Although I yield to none in my admiration 
for Swadesh as a linguistic prehistorian, I find that it is insufficiently appreciated by 
linguists and anthropologists that the primary purpose of glottochronology as a 
technique is to specify degrees of lexical divergence between languages whose kinship 
is established or generally assumed. Were this not so, it would be pointless to 
stipulate, as Swadesh did, that such obvious cognates as English hound and German 
Hund may not be lexicostatistically equated because of their recent loss of synonymy. 
Between languages whose kinship is in doubt, such pairs as hound and Hund—not to 
mention others whose divergence is more pronounced—must be coupled if cognation 
is ever to be demonstrated. 

For this reason, 1 prefer to compare protolanguages and other languages of 
uncertain mutual relationship in terms of what I call the “semantic cluster” principle. 
This principle is based on recognition of the fact that many of the terms in Swadesh’s 
and other word-lists comprise or (more often) help comprise relatively self-contained 


Wescott: Protolinguistics 109 


semantic subsystems. Among these subsystems, or semantic clusters, are personal 
pronouns, numerals, kinship terms, names of body parts, and color terms. Of the 
clusters just cited, pronouns are probably the most persistent, both because they are 
oldest (as evidenced by their greater proneness to become bound forms) and because 
they are least subject to borrowing, !? 

Semantic clusters, however, must be used with care. For each is a subsystem 
composed of infrasystems with characteristics that are sometimes disparate. Thus, 
third-person pronouns, for example, are less persistent than others because they are 
more likely to interdigitate or merge with demonstratives. High numbers are less 
persistent than low numbers because high numbers are more subject to borrowing. 
Nursery terms for mother and father, on the other hand, are persistent to excess: that 
is, they are apparently universal,'3 a fact that makes them useless for subgrouping 
purposes. Terms for excretory or procreative organs may be subject to verbal taboos 
that lead either to deformation or to borrowing. And terms for subtle hues or shades 
are especially likely to be loanwords. 

In addition to such major semantic clusters as kinship systems and color 
terminologies, there are also minor clusters consisting of two or three forms each and 
constituting antonymous polarities. These antynomies may be either dipolar, as in 
the case of the dyad this vs. that, or tripolar, as in the case of the triad male vs. female 
vs. neuter. Minor clusters tend to be more persistent when they fall within the ambit 
of major clusters, as in the case of the dipolarity / vs. thou or the tripolarity white vs. 
black vs. red. 

Lexical persistence, finally—at least insofar as it can be detected with assurance— 
consists not only of semantic persistence but also of phonic persistence. Yet, 
surprisingly, few, explicit studies have been made by historically oriented 
investigators of the relative diachronic durability of phonemes or of distinctive 
features. The only one that | can locate was made sixteen years ago by a jurist and 
philologist named Arthur Diamond. Diamond, having surveyed the Indo-European, 
Semitic, Polynesian, and Bantu languages, concluded that the overall order of 
decreasing consonant persistences is as follows: 


. nasals 

. voiceless stops 
voiced stops 

. voiceless fricatives 
. voiced fricatives 

. linguals !4 


DAnNPRWN— 


For vowels, Diamond’s order of persistence is: 


1. low central vowels 
2. high front and back vowels (including glides) 
3. all other vowels !4 


And, for obstruent and nasal articulatory positions, his order of persistence is: 


i. labial 
2. apical 
3. dorsal, 


although he notes that, among stops, apicals are more persistent than labials.'4 

If Diamond’s analysis is sustained when a larger number of language families is 
surveyed, it would seem that the labial nasal /m/—which is found in nearly all 
languages outside the Salishan and Iroquoian groups !5—is the most widely resistant 


110 Annals New York Academy of Sciences 


of all phonemes to diachronic change, presumably because of the great visual and 
tactile palpability of labials and the distinctiveness of nasality, which is the only 
nonoral consonantal manner. 

In reconstructing the phonemic inventories of protolanguages, it is, of course, 
necessary to do as most Indo-Europeanists have done, namely, to postulate a 
sufficiently large number of phonemes to account for the diversity of reflexes in all 
attested daughter languages. But there is a real danger that in so doing, protolinguists 
will overdifferentiate the aboriginal phonemes and produce an inventory that is 
intrinsically improbable. By this I mean not so much that they will produce a total 
number of phonemes that is incredibly high (although this may in fact happen) as 
that they may produce numerical proportions between types of phonemes that 
violate linguistic universals of the type now being established by linguists at Stanford 
University. '¢ 

An example of such dubious reconstruction is provided by Bjorn Collinder in his 
Comparative Grammar of the Uralic Languages, in which he postulates 23 consonant 
phonemes for PU.'? What is implausible here is not the total inventory but the fact 
that Collinder posits more sibilants than stops, more nasals than stops, and even 
more linguals than stops in PU, despite the fact that “implicative universals” 
expressly veto all three of these proportions. !* 

On the other hand, we should not fall into the alternative error of assuming that 
the phonological and other traits typical of contemporary languages must have been 
typical of all or even most languages throughout prehistory. To begin with, it seems 
inherently improbable that speech sprang fully fledged from either a closed hominoid 
call-system'® or an open proto-hominid gesture language.” Whatever its source, 
speech almost certainly developed by stages, even if its evolution was discontinuous 
in pace.?! 

More precisely, there is evidence that the rate of diachronic sound change (of the 
type described by such formulae as Grimm’s Law) accelerated steadily from the 
Upper Paleolithic Period till the spread of literacy following urbanization. This 
evidence consists chiefly of the striking genetic and typological resemblances between 
protolanguages, which one could scarcely expect to encounter if sound change had 
always been as rapid as it evidently was during the last few millennia B.C. 

Another persistent and even more unidirectional trend has been the steady 
growth of what George Trager calls “microlanguage”—that is, language that is 
grammatically structured to a high degree—at the expense of all other domains of 
what he calls “macrolanguage.”2? The three other domains of Trager’s macro- 
language are what he calls “prelanguage,” “paralanguage,” and “metalanguage.” 
Prelanguage consists of primate calls and infant babblings, paralanguage of 
extragrammatical interjection and speech-deformation, and metalanguage of song 
and other verbal or verbalized arts. Although Trager himself has not proposed any 
unitary term for the three nonmicrolinguistic domains of macrolanguage, my own 
term for this threefold collectivity is “allolanguage.”™ 1 describe aflolanguage as a 
language that is alien to the type of speech and writing most easily generated or 
analyzed by Transformational! methods. 

The divergence between microlanguage and allolanguage, moreover, is one that 
seems to me to be capable of accounting for some of the difficulties linguists 
experience in reconstructing protolanguages. In the case of PIE/PIH, two such 
difficulties are the apparently sporadic occurrence in it of geminated stops and of 
i ~a ~ u vowel apophony. Examples of the former are: PIE pipp-, “to peep”; atta, 
“dad”; and bhukkos, “stag.” Examples of the latter are kin- = kan- = kun-, “dog.” 
Although PIE gemination was probably solely “expressive” and hence allolinguistic, 
contemporaneous Dravidian geminates were phonotactically “normal,” and hence 


Wescott: Protolinguistics 111 


microlinguistic. And the PIE vocoid apophony which was probably also 
allolinguistic (and remains so in English echoic sequences like clink, clank, clunk) 
became microlinguistic in Proto-Germanic (and remains so in English verbal 
paradigms like drink, drank, drunk). 

A third long-range linguistic trend has been the emergence of vowels, first from 
subphonemic to phonemic status and then from a position of rough parity with 
consonants to one of segmental primacy. The chief evidence for a prehistoric 
situation in which only consonants were phonemic is the fact that even today there 
are languages, such as Wishram (a Chinookan Penutioid vernacular of Oregon), that 
can be analyzed as being univocalic, and systemically, therefore, vowelless.4 And 
there are many other languages—perhaps as many as 10% of contemporary 
vernaculars—that have fewer than the five to twelve vowel phonemes that 
characterize a majority of the world’s languages. These include Abkhazi, a North 
West Caucasian language, which has two vowels; latmul, an Indo-Pacific language of 
New Guinea, which has three vowels; and Manobo, an Austronesian language of the 
Philippines, which has four vowels. What is striking about these languages is that 
nearly all of them come from such refuge areas as mountain ranges, small islands, 
polar regions, or isolated continents. This fact, in turn, dovetails nicely with the 
observation that protolanguages generally seem to have had fewer vowel phonemes 
than their daughter languages. Proto-Semitic, for example, probably had only three 
microlinguistic vowels, and PIH only two. 

The sense, on the other hand, in which vowels may be said to have reached not 
only parity with consonants but primacy among segmentals is that most contempo- 
rary languages require vowels in all syllables and permit both syllables and words 
that consist of vowels only. 

A fourth long-range linguistic trend is one that I call “optimization of phonemic 
inventories.” By this I mean not only that the size of phonemic inventories, like the 
pace of sound change, seems to have reversed its evolutionary direction but also that, 
in the case of phoneme numbers, there seems to have been (in effect) an effort to 
strike an optimal mean between the extremes of too few and too many phonemes. 
Among living languages, phonemic inventories run from a minimum of about 10 toa 
maximum of about 100 segmental phonemes, with an average of 25 to 30. We may 
assume, | think, that a lack of oral dexterity kept Upper Paleolithic phonemic 
inventories small. By the Mesolithic Period, however, it is likely that increasing 
dexterity had led to inventories that averaged numbers near the contemporary upper 
limit. Evidence for this phonemic amplitude comes both from typological surveys of 
contemporary languages and from reconstructive work in protolinguistics. In the 
former case, we note that the phonemic inventories of many “refuge area” 
languages—ranging in location from Africa’s Kalahari Desert to North American 
Indian reservations—have segmentals ranging in number from more than 40 to more 
than 100. In the latter, we note that, particularly in the case of the so-called 
“laryngeals” of Indo-Hittite, many of the peculiarities of the historical languages can 
be most readily explained by assuming the existence in PIH of dorsal obstruents that 
were subsequently lost. 

This observation leads directly to identification of a fifth and final long-range 
phonological trend in the evolution of speech. Like the preceding trend, it is 
curvilinear and optimizing, rather than rectilinear and maximizing, in nature. This 
fifth trend is a shift of positional primacy initially from labiality to dorsality and later 
from dorsality to apicality. Both the study of human speech ontogeny and that of 
primate phonation suggest labial primacy, probably because the lips are visually and 
tactilely the most accessible of speech organs. Later, however, when the consonantal 
inventory expanded, expansion apparently occurred primarily in the postapical or 


112 Annals New York Academy of Sciences 


dorsal region. Evidence for this inference comes from refuge-area languages, where 
we find a strikingly high proportion of clicks, pharyngeals, and glottalized 
obstruents. Vestiges of both Upper Paleolithic labiality and Mesolithic dorsality may 
well be detectable in scattered survivals of labial trills and various glottalics among 
the world’s languages, the latter being commoner than the former because of the 
greater recency of dorsal than of labial dominance. 

Today, however, an overwhelming majority of the world’s languages show apical 
dominance either in the sense that gaps in their obstruent systems are usually 
preapical or postapical rather than apical or in the sense that their sonorant 
systems——and especially their lingual subsystems—are primarily or exclusively 
apical. This development, if not an optimizing one, is certainly a medianizing one. 

Thus far, we have written as though the problem of reconstructing protolan- 
guages were minor, and the only serious question that of relating protolanguages to 
one another. But in fact this is not so. Although nearly all linguists, for example, 
concede the genealogical unity of the Semitic, Egyptian, Berber, Cushitic, Omotic, 
and Chadic language groups, they have proved unable as yet to reconstruct Proto- 
Afro-Asian, the presumed parent language of the entire family. The same is true of 
Proto-Congo-Kordofanian, the presumed parent language of the Atlantic, Voltaic, 
Adamawa, Kwa, Bantoid, and Kordofanian groups. The chief reason for these 
lacunae is the predominant irregularity of sound correspondences between the 
recorded languages of these groups. 

More precisely, what linguists have discovered is that, although exact phonemic 
correspondences between most African languages cannot be established, lexical 
cognations can, because corresponding phonemes in related languages usually share 
phonetic features. In other words, sound correspondences in these and other 
language families of great internal time depth must be formulated either 
subphonemically, in terms of articulatory or acoustic features, or transphonemically, 
in terms of morphophonemes. 

An example of such nonphonemic sound correspondences is provided by the 
Niger-Kordofanian lexeme for “goat”: 


Group Language Form 
Atlantic Fulani be’- 
Voltaic Mossi bū- 
Kwa Yoruba ewur€ 

Igede emf” 
Ijaw Proto-ljaw òbóri 
Bantoid Tiv ivo 

Jukun bina 

Proto-Bantu buli 


Here the familial proto-form should probably be written as a noun-base, -BEN-, 
where B represents any voiced labial consonant, E any non-low vowel, and N 
any apical sonorant. The teason why we cannot--and should not—be more precise 
than this is that the cross- language correspondences b~b~v~m-~w, 
ine~we ~o~ uy wu, and n~r ~! in other lexemes do not occur in a consistent 
and predictable manner but rather in a capricious and unpredictable manner. 

It is, however, unlikely that there was never any reason for these phonic 
variations. Instead, it is probable that alternations of this kind are remnants of a 
nongrammatical but semantically systematic type of apophony. In the case of the 
labial series, the opposition of stop and fricative is paralleled by such English pairings 
as pooh ~ fooey, that of stop and nasal by English pa ~ ma, and that of stop and 


Wescott: Protolinguistics 113 


glide by English pow ~ wow. Although the putatively aboriginal meanings of these 
labial sound contrasts remain obscure, those of the apical contrasts are partially 
discernible on the basis of onomastic patterns that persist in modern languages. In 
such pairs as Sarah ~ Sally and even more in such triads as Henry ~ Harry ~ Hal, 
we observe an incompletely fossilized phonosemic system of augmentation and 
diminution, in which the lateral diminishes, the vibrant augments, and the nasal 
maintains neutrality of size. Outside the domain of proper names, the same threefold 
opposition may be preserved in English hound ~ horse ~ whelp, if we assume that 
horses were first thought of if not as oversized dogs, at least as weighty coursers.2¢ 

in addition to fossilized patterns of apophony, archaic languages probably also 
had patterns of affixation that likewise serve to obscure regular sound correspon- 
dences across languages. Among these are sonorant infixations of two types: 
prenuclear (occurring before the syllabic peak) and postnuclear (occurring after the 
syllabic peak). Although they have evidently ceased to be microlinguistically 
productive, they remain productive in allolanguage. Examples from PIE micro- 
language and English allolanguage follow: 


Optional 

infix PIE English 
(E)i pek- ~ peik-, “to fleece” grip~ gripe 
(E)u bhegt- ~ bheug-, “to flee” crackle ~ croak 
I(E) kEu- ~ klEu-, “to hear” beep ~ bleep 
1(E) bheg- ~ bhreg-, “to break” caw ~ crow 
(E)n ghed- ~ ghend-, “to take” click ~ clink 
n(E) megh- ~ men(e)gh-, “large” kick ~ knock 27 


Consonantal apophony and sonorant infixation, whether acting sequentially or in 
concert, may easily obscure ancient cognations that would otherwise be patent. An 
example of apophonic obscuration is the formal contrast between PIH kan-, Proto- 
Cushitic kar-, and Proto-Semitic kal-, all meaning “dog.” And an example of 
infixative obscuration is the treble contrast between PIE kuon- and PU koir-, also 
meaning “dog,” in which PIE exhibits prenuclear infixation, selection of an 
augmentative glide, and preservation of a neutral sonorant, but PU exhibits 
postnuclear infixation, selection of a diminutive glide, and substitution of an 
augmentative sonorant. (It is possible, of course, to reformulate the distinction 
between apophony and infixation as a distinction between two different kinds of 
infixation: replacive in the case of kan- ~ kar- ~ kal- and additive in the case of 
kuon- ~ koir- ~ koN-.) 

Yet, whether or not we define apophony as a type of infixation, resorting to it as 
an explanatory device does not solve all the problems of protolinguistics. On the 
contrary, apophony may create new problems, the most conspicuous of which is 
ambiguity. In the realm of vowel alternation, we know that apophony may be either 
horizontal (as in the case of English gleam ~ gloom) or vertical (as in the case of 
English “squish”/skwi8/ ~ squash/skwa8/.2? The same is true in the realm of 
consonant alternation (as in the cases of English grip ~ grab and damp ~ dank, 
respectively).*° If carried to its logical extreme, this principle obviously allows us— 
ultimately—to equate any speech sound with any other speech sound by resorting to 
horizontal, vertical, or, combining the two, diagonal apophony (of the kind that 
occurs in English squeak ~ squawk or Betty ~ Bessy.)3! 

My tentative solution to this problem hinges on the observation that even today, 
most consonant alternation is vertical rather than horizontal (presumably because 
the average consonant inventory has more manners than positions), whereas most 


114 Annals New York Academy of Sciences 


vowel alternation is horizontal rather than vertical (presumably, again, because the 
typical five-vowel inventory can readily be arranged in a front-to-back sequence but 
not in a high-to-low one). If we may assume that archaic consonantal apophony was 
wholly vertical and that later nonvertical alternations were due to analogizing 
feedback from the subsequently developed vowel system, our problem is reduced to 
one of separating evolutionary linguistic strata. 

A similar problem arises with regard to the question of distinguishing early base 
consonants from infix consonants. This problem persists in some contemporary 
forms, such as English chirp, which can be derived, with equal plausibility, either 
from chip (with a lingual infix) or from chirr (with an occlusive suffix). In most cases, 
to be sure, such alternative etyma will not spring to hand. But this fact attests only to 
fortuitous defects in the linguistic record, not to a lack of aboriginal morphological 
ambiguity. Perhaps, however, the earliest UPS bases consisted exclusively of 
obstruents, sonorants having been restricted to infixes. And it may be, too, that when 
external affixes came to outnumber internal affixes, suffixes were wholly or 
predominantly syllabic—specifically, diphonemic, consisting of a'vowel followed by 
a consonant. Either way, protolinguistic etymologies would be partially if not wholly 
disambiguated. 

Etymological ambiguity of a slightly different kind besets the linguist who seeks 
to establish cognations between protolanguages. For, because protolanguages 
become increasingly similar the further back one traces them, the more difficult it 
becomes to distinguish inherited from borrowed vocabulary in any given 
protolanguage. The protolinguist’s problem here is a little like that of the historian of 
English who, though he has no difficulty isolating French elements in Middle 
English, is sometimes uncertain whether to ascribe non-Romanic elements to Old 
English or to Old Norse, since the latter two were often indistinguishable. The 
protolinguist’s problem, however, is of course the more difficult, since he has no 
documentary evidence to assist him. An example of this kind of difficulty is the 
obvious identity of PIE g¥ou-, PU ku-, Proto-Turkic ko-, Proto-Bantu -go, and 
Sumerian gu-, all meaning “cattle.” If the implied meaning was “domesticates,” the 
chances are that all these protoforms were loans from Sumerian or a nearby 
language; but if the implied meaning was merely “bovines,” the chances are that most 
if not all of them are cognates. 

A special case of this difficulty is supplied by prehistoric pidginization. If, for 
example, Proto-Germanic was in fact an Indo-European pidgin spoken as a second 
language by native speakers of Uralic, one could argue either that its vocabulary was 
borrowed from Indo-European or that its structure was borrowed from Uralic. 

A particularly puzzling example of apparent cognation is provided by a North 
American language. In Lake Miwok, a Penutian language of northern California, the 
personal verb inflections are: 


Ist person singular—m, plural—mas 
2nd person singular—s, plural—tos 
3rd person singular—®, plural—p 


If we assume that Miwok -p and PIE -nt are alternative assimilations of an earlier 
*-mt (as we assume that the PIE accusative plural ending -ns is an assimilation 
of earlier *-ms), then the correspondence between Lake Miwok and Indo-Euro- 
pean becomes almost unbelievably close. Few can say whether this closeness is due to 
a special link between Indo-European and the American Indian languages, as Mary 
Le Cron Foster thinks; 3? to prehistoric Pacific crossings from the Old World to the 
New, as various diffusionists suggest; or to the kind of improbable coincidence that 
Carl Jung called “synchronicity.” But, whichever it is, this is the kind of detailed 


Wescott: Protolinguistics 115 


transfamilial parallel that arrests our attention and invites us to rethink linguistic 
prehistory. 

Some protolinguistic problems, finally, are due almost exclusively to lack of 
information. One such lacuna is the question of unidirectionality in sound change. It 
is, for example, assumed by most historical linguists that palatized velar stops readily 
become prepalatal affricates or sibilants and that sibilants readily become dorsal 
fricatives—as in Sanskrit satam, “hundred,” from PIE kmtom or Classical Greek 
hepta, “seven,” from PIE septm—but the reverse changes rarely, if ever, occur. 
Unfortunately, however, no broad empirical survey of the subject has (to the best of 
my knowledge) ever been made. And until it is made, we will be compelled to make 
do with impression and speculation. Let us hope that conscientious collection of 
evidence may soon remove us from this limbo. 


NOTES AND REFERENCES 


I. Wescott, R. W., Ed. 1974. Language Origins: Chap. 6, The Origin of Speech: 115-116. 
Linstok Press. Silver Spring, Md. 

2. Hart, R. A., Jr. 1974. An External History of the Romance Languages. The 
Reconstruction of Proto-Romance: 16-17. American Elsevier. New York, N.Y. 

3. LEHMANN, W. H. 1952. Proto-Indo-European Phonology: 99. University of Texas Press. 
Austin, Texas. 

4. STURTEVANT, E. H. 1942. The Indo-Hittite Laryngeals. Linguistic Society of America. 
Baltimore, Md. 

5. Etruscan or a closely related vernacular was apparently also spoken on the Aegean island 
of Lemnos. Vladimir I. Georgiev (Introduzione alla Storia delle Lingue indeuropee, 
Edizioni del!’ Ateneo, Rome, 1966) believes that Etruscan was a later form of Hittite. 

6. Carnoy, A. 1955. Dictionnaire Etymologique du Proto-Indo-Européen. Université de 
Louvain. Belgium. z 

7. TYLER, S. A. 1968. Dravidian and Uralian: The lexical evidence. Language 44(4). 

8. McALPIN, D. W. 1974. Toward Proto-Elamo-Dravidian. Language 50(1): 89-101. 

9. PEDERSEN, H. 1903. Tuerkische Lautgesetze. Zeitschrift der Deutschen Morgenlaen- 
dischen Gesellschaft. Band 57: 560. 

10. Wescott, R. W. A Bini morphicon. /n Linguistic and Literary Studies in Honor of 
Archibald A. Hill. Edgar Polomé, Ed. Pieter de Ridder Press. Lisse, The Netherlands. 
In press. 

11. Kerns, J. C. 1967. Eurasiatic Pronouns and the Indo-Uralic Question. Fairborn, Ohio. 

12. Dyen, I., A. T. James & J. W. L. Core. 1967. Language divergence and estimated word 
retention rate. Language 43(1): 150-171. 

13. Murpock, G. P. 1959. Cross-language parallels in parental kin terms. Anthropological 
Linguistics 1(9): 1-5. 

14. Diamonp, A. S. 1959. The History and Origin of Language: Chap. 15, The Earliest 
Words, 197-199. Philosophical Society. New York, N.Y. 

15. Hocker, C. F. 1955. A Manual of Phonology. (Memoir 11. Int. J. of Am. Linguistics 
21(4): 19. 

16. GREENBERG, J. H., et al. (Committee on Linguistics.) 1969—. Working Papers on 
Language Universals, Stanford University. Stanford, Calif. 

17. CoLLINDER, B. 1960. A Comparative Grammar of the Uralic Languages. Almqvist and 
Wiksell. Stockholm, Sweden. 

18. GREENBERG, J. H. 1968. Anthropological Linguistics: An Introduction: Chap. 8, 
Synchronic Universais. Random House. New York, N.Y. 

19. HockeTt, C. F. 1959. Animal “languages” and human language. In The Evolution of 
Man’s Capacity for Culture. James N. Spuhler, Ed. Wayne State University Press. 
Detroit, Mich. 

20. Hewes, G. W. 1973. Primate communication and the gestural origin of language. Current 
Anthropol. 14(1): 5-24. 


116 


21. 
22. 
23. 
24. 
25. 
26. 
21. 
28. 
29. 
30. 
3I. 
32. 


33. 


Annals New York Academy of Sciences 


WESCOTT, R. W. 1967. The Evolution of Language: Re-opening a closed subject. Studies 
in Linguistics 19: 67-82. 

Tracer, G. L. 1955. Language. In Encyclopaedia Britannica. Chicago, Ill. 

Wescott, R. W. 1975. Allolinguistics: exploring the peripheries of speech. Jn The Second 
LACUS Forum. Adam Makkai and Valerie Becker Makkai, Eds. Hornbeam Press. 
Columbia, S.C. 

HOCKETT, C. F. 1963. The problem of universals in language. /n Universals of Language. 
Joseph H. Greenberg, Ed. M.1.T. Press. Cambridge, Mass. 

WINTER, W., Ed. 1965. Evidence for Laryngeals. Mouton. The Hague, The Netherlands. 

For fuller treatment of the subject of archaic apophony, see WEScoTT, R. W., 1974, Types 
of apophony in proto-speech. /n Language Origins, Roger W. Wescott, Ed. Linstok 
Press. Silver Spring, Md. 

The initial k of knock is still pronounced in Scottish English. In American English, it is 
also preserved in the allolog konk. 

Although most Indo-Hittite languages exhibit reflexes of the form kuOn-, an allolog kan- 
is indicated by Latin canis, Middle Irish cano, “wolf-pup,” and Thracian kandáðn and 
Lydian kandaules, “dog-slayer.” 

WEscoTT, R. W. 1970. Types of vowel alternation in English. Word 26(3). 

Wescott, R. W. 1971. Labio-velarity and derogation in English: a study in phonosemic 
correlation. American Speech 46(2). 

Here the nickname Beth is assumed to stand in the vertically apophonic relation to the 
allolog Betty. 

Foster, M. Le C. 1971. American Indian and Old World languages. Presented at ann. 
mtg. Amer. Anthropological Assn. New York, N.Y. 

Notably George F. Carter, Edwin Doran, Jr., and Jonathan D. Sauer. 1971. Jn Man 
Across the Sea: Problems of Pre-Columbian Contacts. Carroll L. Riley and J. Charles 
Kelley, Eds. University of Texas Press. Austin, Texas. 


